๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ—ฃ๏ธ CMU Pronouncing

Phonetic Dictionaries, Speech Synthesis, Linguistic Resources, Audio Processing

MultiVox: Benchmarking Voice Assistants for Multimodal Interactions
arxiv.orgยท7h
๐ŸŽ™๏ธWhisper
What can we expect of LLMs as Software Engineers?
chelseatroy.comยท1d
๐Ÿ“Text Parsing
I'm Switching to Python and Actually Liking It
cesarsotovalero.netยท3hยท
Discuss: Hacker News
๐ŸŒ€Brotli Internals
Grammatical Structure and Grammatical Variations in Non-Metric Iranian Classical Music
arxiv.orgยท7h
๐ŸŽผComputational Musicology
Students, here are 5 key things to know when learning how to train large language models
techradar.comยท17h
๐Ÿ’ปLocal LLMs
Bibliographical cornucopia for linguists, part 1
languagelog.ldc.upenn.eduยท1d
๐Ÿ“ผCassette Linguistics
Supporting SEN\'{C}OTEN Language Documentation Efforts with Automatic Speech Recognition
arxiv.orgยท7h
๐ŸŽ™๏ธWhisper
Mistral launches Voxtral speech recognition model
theregister.comยท15h
๐ŸŽ™๏ธWhisper
UGC-VideoCaptioner: An Omni UGC Video Detail Caption Model and New Benchmarks
arxiv.orgยท7h
๐Ÿ“ŠLearned Metrics
HanjaBridge: Resolving Semantic Ambiguity in Korean LLMs via Hanja-Augmented Pre-Training
arxiv.orgยท7h
๐ŸŒณContext free grammars
[P] Help with Contrastive Learning (MRI + Biomarkers) โ€“ Looking for Guidance/Mentor (Willing to Pay)
reddit.comยท21hยท
Discuss: r/MachineLearning
๐ŸŽตAudio ML
Towards Spatial Audio Understanding via Question Answering
arxiv.orgยท1d
๐ŸŽตAudio ML
Mixture of LoRA Experts with Multi-Modal and Multi-Granularity LLM Generative Error Correction for Accented Speech Recognition
arxiv.orgยท1d
๐ŸŽ™๏ธWhisper
Unsupervised Learning NO. 489
newsletter.danielmiessler.comยท1d
๐Ÿ—œ๏ธLZW Variants
BENYO-S2ST-Corpus-1: A Bilingual English-to-Yoruba Direct Speech-to-Speech Translation Corpus
arxiv.orgยท1d
๐Ÿ“ABNF Extensions
Let AI Tune Your Voice Assistant
towardsdatascience.comยท1d
๐ŸŽ™๏ธWhisper
Porting a Cross-Disciplinary Text Mining Course Online Using Innovative Engagement Techniques
hackernoon.comยท20h
๐Ÿค–Grammar Induction
The Man Behind the Sound: Demystifying Audio Private Attribute Profiling via Multimodal Large Language Model Agents
arxiv.orgยท1d
๐ŸŽตAudio ML
THAI Speech Emotion Recognition (THAI-SER) corpus
arxiv.orgยท1d
๐Ÿ‘‚Psychoacoustic Coding
Language Models for Adult Service Website Text Analysis
arxiv.orgยท7h
๐Ÿ“Text Parsing
Loading...Loading more...
AboutBlogChangelogRoadmap